46 research outputs found

    A global gene evolution analysis on Vibrionaceae family using phylogenetic profile

    Get PDF
    <p>Abstract</p> <p>Background</p> <p><it>Vibrionaceae </it>represent a significant portion of the cultivable heterotrophic sea bacteria; they strongly affect nutrient cycling and some species are devastating pathogens.</p> <p>In this work we propose an improved phylogenetic profile analysis on 14 <it>Vibrionaceae </it>genomes, to study the evolution of this family on the basis of gene content.</p> <p>The phylogenetic profile is based on the observation that genes involved in the same process (e.g. metabolic pathway or structural complex) tend to be concurrently present or absent within different genomes. This allows the prediction of hypothetical functions on the basis of a shared phylogenetic profiles. Moreover this approach is useful to identify putative laterally transferred elements on the basis of their presence on distantly phylogenetically related bacteria.</p> <p>Results</p> <p><it>Vibrionaceae </it>ORFs were aligned against all the available bacterial proteomes. Phylogenetic profile is defined as an array of distances, based on aminoacid substitution matrixes, from single genes to all their orthologues. Final phylogenetic profiles, derived from non-redundant list of all ORFs, was defined as the median of all the profiles belonging to the cluster. The resulting phylogenetic profiles matrix contains gene clusters on the rows and organisms on the columns.</p> <p>Cluster analysis identified groups of "core genes" with a widespread high similarity across all the organisms and several clusters that contain genes homologous only to a limited set of organisms. On each of these clusters, COG class enrichment has been calculated. The analysis reveals that clusters of core genes have the highest number of enriched classes, while the others are enriched just for few of them like DNA replication, recombination and repair.</p> <p>Conclusion</p> <p>We found that mobile elements have heterogeneous profiles not only across the entire set of organisms, but also within <it>Vibrionaceae</it>; this confirms their great influence on bacteria evolution even inside the same family. Furthermore, several hypothetical proteins highly correlate with mobile elements profiles suggesting a possible horizontal transfer mechanism for the evolution of these genes. Finally, we suggested the putative role of some ORFs having an unknown function on the basis of their phylogenetic profile similarity to well characterized genes.</p

    PABS: An online platform to assist BAC-by-BAC sequencing projects

    Get PDF
    Genome sequencing projects are either based on whole genome shotgun (WGS) or on a BAC-by-BAC strategy. Although WGS is in most cases the preferred choice, sometimes the BAC-by-BAC approach may be better because it requires a much simpler assembly process. Furthermore, when the study is limited to specific regions of the genome, the WGS would require an unjustified effort, making the BAC-by-BAC the only feasible strategy. In this paper we describe an informatics pipeline called PABS (Platform Assisted BAC-by-BAC Sequencing) that we developed to provide a tool to optimize the BAC-by-BAC sequencing strategy. PABS has two main functions: (i) PABS-Select, to choose suitable overlapping clones; and (ii) PABS-Validate, to verify whether a BAC under analysis is actually overlapping the neighboring BAC

    QueryOR: a comprehensive web platform for genetic variant analysis and prioritization

    Get PDF
    Background: Whole genome and exome sequencing are contributing to the extraordinary progress in the study of human genetic variants. In this fast developing field, appropriate and easily accessible tools are required to facilitate data analysis. Results: Here we describe QueryOR, a web platform suitable for searching among known candidate genes as well as for finding novel gene-disease associations. QueryOR combines several innovative features that make it comprehensive, flexible and easy to use. Instead of being designed on specific datasets, it works on a general XML schema specifying formats and criteria of each data source. Thanks to this flexibility, new criteria can be easily added for future expansion. Currently, up to 70 user-selectable criteria are available, including a wide range of gene and variant features. Moreover, rather than progressively discarding variants taking one criterion at a time, the prioritization is achieved by a global positive selection process that considers all transcript isoforms, thus producing reliable results. QueryOR is easy to use and its intuitive interface allows to handle different kinds of inheritance as well as features related to sharing variants in different patients. QueryOR is suitable for investigating single patients, families or cohorts. Conclusions: QueryOR is a comprehensive and flexible web platform eligible for an easy user-driven variant prioritization. It is freely available for academic institutions at http://queryor.cribi.unipd.it/

    Laterally transferred elements and high pressure adaptation in Photobacterium profundum strains

    Get PDF
    BACKGROUND: Oceans cover approximately 70% of the Earth's surface with an average depth of 3800 m and a pressure of 38 MPa, thus a large part of the biosphere is occupied by high pressure environments. Piezophilic (pressure-loving) organisms are adapted to deep-sea life and grow optimally at pressures higher than 0.1 MPa. To better understand high pressure adaptation from a genomic point of view three different Photobacterium profundum strains were compared. Using the sequenced piezophile P. profundum strain SS9 as a reference, microarray technology was used to identify the genomic regions missing in two other strains: a pressure adapted strain (named DSJ4) and a pressure-sensitive strain (named 3TCK). Finally, the transcriptome of SS9 grown under different pressure (28 MPa; 45 MPa) and temperature (4°C; 16°C) conditions was analyzed taking into consideration the differentially expressed genes belonging to the flexible gene pool. RESULTS: These studies indicated the presence of a large flexible gene pool in SS9 characterized by various horizontally acquired elements. This was verified by extensive analysis of GC content, codon usage and genomic signature of the SS9 genome. 171 open reading frames (ORFs) were found to be specifically absent or highly divergent in the piezosensitive strain, but present in the two piezophilic strains. Among these genes, six were found to also be up-regulated by high pressure. CONCLUSION: These data provide information on horizontal gene flow in the deep sea, provide additional details of P. profundum genome expression patterns and suggest genes which could perform critical functions for abyssal survival, including perhaps high pressure growth

    Direct 16S rRNA-seq from bacterial communities: a PCR-independent approach to simultaneously assess microbial diversity and functional activity potential of each taxon

    Get PDF
    The analysis of environmental microbial communities has largely relied on a PCR-dependent amplification of genes entailing species identity as 16S rRNA. This approach is susceptible to biases depending on the level of primer matching in different species. Moreover, possible yet-to-discover taxa whose rRNA could differ enough from known ones would not be revealed. DNA-based methods moreover do not provide information on the actual physiological relevance of each taxon within an environment and are affected by the variable number of rRNA operons in different genomes. To overcome these drawbacks we propose an approach of direct sequencing of 16S ribosomal RNA without any primer- or PCR-dependent step. The method was tested on a microbial community developing in an anammox bioreactor sampled at different time-points. A conventional PCR-based amplicon pyrosequencing was run in parallel. The community resulting from direct rRNA sequencing was highly consistent with the known biochemical processes operative in the reactor. As direct rRNA-seq is based not only on taxon abundance but also on physiological activity, no comparison between its results and those from PCR-based approaches can be applied. The novel principle is in this respect proposed not as an alternative but rather as a complementary methodology in microbial community studies

    The ocean sampling day consortium

    Get PDF
    Ocean Sampling Day was initiated by the EU-funded Micro B3 (Marine Microbial Biodiversity, Bioinformatics, Biotechnology) project to obtain a snapshot of the marine microbial biodiversity and function of the world’s oceans. It is a simultaneous global mega-sequencing campaign aiming to generate the largest standardized microbial data set in a single day. This will be achievable only through the coordinated efforts of an Ocean Sampling Day Consortium, supportive partnerships and networks between sites. This commentary outlines the establishment, function and aims of the Consortium and describes our vision for a sustainable study of marine microbial communities and their embedded functional traits

    The Ocean Sampling Day Consortium

    Get PDF
    Ocean Sampling Day was initiated by the EU-funded Micro B3 (Marine Microbial Biodiversity, Bioinformatics, Biotechnology) project to obtain a snapshot of the marine microbial biodiversity and function of the world’s oceans. It is a simultaneous global mega-sequencing campaign aiming to generate the largest standardized microbial data set in a single day. This will be achievable only through the coordinated efforts of an Ocean Sampling Day Consortium, supportive partnerships and networks between sites. This commentary outlines the establishment, function and aims of the Consortium and describes our vision for a sustainable study of marine microbial communities and their embedded functional traits

    The hydrobiological station of Chioggia: a platform to study the coasts and lagoons of the north-western Adriatic Sea

    No full text
    The Hydrobiological Station \u201cUmberto D\u2019Ancona\u201d (chioggia.biologia.unipd.it/en/), founded in 1941 in Chioggia, inside the Venetian Lagoon (northern Adriatic Sea), constitutes an historical field station for researches on marine and lagoon environments. Strongly linked to the local realities, the Station collected several data and built databases over time. Two continuous datasets are freely available and yearly updated. From 1970s data on air and water temperature, salinity, pH, dissolved oxygen are collected inside the Venetian Lagoon. On the other hand a database on monthly landings of several marine species, reported by the local fishery, is maintained starting from 1945. Ongoing researches on coasts include: the relationship between environmental characteristic and fish communities of hard substrates, the spatial and temporal variation in microbiological communities, the effects of climate changes (acidification and temperature) and pollutions on marine invertebrates, fish, microbes and algae, the genetic structure of marine organisms, the relationship between environment and behavior in marine fish. The Station constitutes also a reference for local fishermen for the early warning of invasive species. Here we propose to expand the researches to address the challenges faced by coastal areas including the monitoring of key species (defined as study species with other research institutions), over space and time as sentinel for an earlier management of ecosystem changes

    Measuring the loss of duplicated genes in plant genomes assembled by means of short reads

    No full text
    Introduction The assembly of a genome is a complex task, whose hardest step is the resolution of repeats. As these regions are usually considered of minor concern for the description of the features of the genome, they are often poorly characterized in a genome analysis. By definition, any region present at least twice in the genome is a repeat, therefore duplicated genes could fall in this category, leading to an underestimation of duplication events in genomes. This effect could be exacerbated in the k-mers based short reads assembly algorithms. Methods While working in gap closure experiments for the tomato genome it came out that some of the unplaced contigs were duplicated genes. To our knowledge this loss of duplicated genes has never been measured for plant genomes. For this reason, the Arabidopsis thaliana genome was used as a reference sequence to generate simulated paired-end Illumina reads, that were assembled with De Bruijn graph based algorithms. Moreover, short reads data of other publicly available Arabidopsis thaliana ecotypes were similarly assembled and compared to the corresponding reference guided assemblies. Results The comparison between the already published genome assemblies and the De Bruijn graph based assemblies allowed us to investigate duplicated genes in terms of: 1) how many genes are missing in the genomes; 2) how the k-mers lengths may affect the loss/presence of duplicated genes in the genomes; 3) highlight how the structure of the duplicated genes can be affected by differential degree of nucleotide conservation. Discussion All the eukaryotic genome projects are now performed by means of short reads production and assembly. The impact of the sequencing strategy on duplicated gene representativeness should produce new insight to be considered when studying plant genomes and their evolution
    corecore